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Abstract 

Write A < B \i there is an injection from A to B, and A x 5 if 
there is a bijection. We give a simple proof that for finite n, n x A < 
n X B implies A < B. From the Cantor-Bernstein theorem it then 
follows that n X A xi n X B implies A B. These results have 
a long and tangled history, of which this paper is meant to be the 
culmination. 

For John 

1 The gist of it 

To show: If there is a one-to-one map from 4 x A to 4 x i? (which need not 
hit all of the range 4x1?), then there is a one-to-one map from A to B. 

A word to the wise: Check out what Rich Schwartz has to say in [5]. 
Proof. Think of 4 x i? as a deck of cards where for each x in B there are 
cards of rank x in each of the four suits spades, hearts, gems, clubs. Note 
that while we use the word ‘rank’, in this game all ranks will be worth the 
same: Who is to say that a king is worth more or less than a queen? 

Think of A as a set of racks, where each rack has four spots to hold cards, 
and think of 4 x A as the set of all the spots in all the racks. 

*The authors hereby waive all copyright and related or neighboring rights to this work, 
and dedicate it to the public domain. This applies worldwide. 
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Think of a one-to-one map from 4Xy4to4xi?asa way to fill the racks 
with cards, so that all the spots have cards, thongh some of the cards may 
not have been dealt out and are still in the deck. 

Name the four spots in each rack by the four suits as they come in bridge, 
with spades on the left, then hearts, gems, clubs. Call a spade ‘good’ if it is 
in the spades spot of its rack, and ‘bad’ if not. 

Do these two rounds in turn. (As you read what is to come, look on to 
where we have worked out a case in point.) 

Shape Up: If a rack has at least one bad spade and no good spade, take 
the spade that is most to the left and swap it to the spades spot so that it 
is now good. Do these swaps all at the same time in all of the hands, so as 
not to have to choose which swap to do hrst. 

Ship Out; Each bad spade has a good spade in its rack, thanks to the 
Shape Up round. Swap each bad spade for the card whose rank is that of 
the good spade in its rack, and whose suit is that of its spot. To see how this 
might go, say that in some rack the queen of spades is in the spades spot, 
while the jack of spades is in the hearts spot. In this case we should swap 
the jack of spades for the queen of hearts. (Take care not to swap it for the 
jack of hearts!) Note that some spades may need to be swapped with cards 
that were left in the deck, but this is fine. Do all these swaps at once, for all 
the bad spades in all the racks. This works since no two bad spades want to 
swap with the same card. 

Note. If you want, you can make it so that when there is more than one 
bad spade in a rack, you ship out just the one that is most to the left. 

Now shape up, ship out, shape up, ship out, and so on. At the end of 
time there will be no bad spades to be seen. (Not that all bad spades will 
have shaped up, or been put back in the deck: Some may be shipped from 
spot to spot through all of time.) Not all the cards in the spades spots need 
be spades, but no card to the right of the spades spot is a spade. So if we 
pay no heed to the spades spots, we see cards of three suits set out in racks 
with three spots each, which shows a one-to-one map from 3 x A to 3 x 5. 

You see how this goes, right? We do a new pass, and get rid of the hearts 
from the last two spots in each rack. Then one last pass and the clubs spots 
have just clubs in them. So the clubs show us a one-to-one map from the set 
A of racks to the set B of ranks. Done. 

Is this clear? It’s true that we have left some things for you to think 
through on your own. You might want to look at [5], where Rich Schwartz 
has put in things that we have left out. 
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Here’s the first pass in a case where all the cards have been dealt out. 
Note that in this case we could stop right here and use the spades to match 
A with B, but that will not work when A and B get big. 
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2 Discussion and plan 

2.1 Division by any finite number 

Write A ^ B (‘A is less than or equal to i?’) if there is an injection from A 
to B. Write A ^ B (‘A equals B\ with apologies to the equals sign) if there 
is a bijection. 

The method we’ve described for dividing by four works fine for any finite 
n, so we have: 

Theorem 1. For any finite n,nxAfi^nxB implies A F B. 

From the Cantor-Bernstein theorem (Prop. [5] below) we then get 

Theorem 2. For any finite n, n x A'^ n x B implies A ^ B. 

As an application of the Theorem [H Lindenbaum and Tarski (cf. [U p. 
305], [H Theorem 13]) proved the following: 

Theorem 3. IfmxA^nxB where gcd(m, n) = 1, then there is some R 
so that A^ n X R and B ^ m x R. 

We reproduce Tarski’s proof in Section 16.21 below. 

Combining Theorems H] and |2] yields this omnibus result for division of 
an equality |8l Corollary 14]: 

Theorem 4. IfmxA^xnxB where gcd(m, n) = d, then there is some R 
so that A X (n/d) x R and B x {m/d) x R. 

2.2 Pan Galactic Division 

We call the shape-up-or-ship-out algorithm for eliminating bad spades Ship¬ 
shaping. As we’ve seen, Shipshaping is the basis for a division algorithm 
that we’ll call Pan Galactic Long Division. As the name suggests, there is 
another algorithm called Pan Galactic Short Division, which we’ll come to 
presently. ‘Pan Galactic’ indicates that we think this is the ‘right way’ to 
approach division, and some dehnite fraction of intelligent life forms in the 
universe will have discovered it. Though if this really is the right way to 
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divide, there should be no need for this Pan Galactic puffery, we should just 
call these algorithms Long Division and Short Division. Which is what we 
will do. 

2.3 What is needed for the proof? 

Shipshaping and its associated division procedures are effective (well-dehned, 
explicit, canonical, equivariant, ...), and do not require the well-ordering 
principle or any other form of the axiom of choice. Nor do we need the 
axiom of power set, which is perhaps even more suspect than the axiom of 
choice. In fact we don’t even need the axiom of inhnity, in essence because 
if there are no inhnite sets that all difficulties vanish. Still weaker systems 
would suffice: It would be great to know just how strong a theory is needed. 

2.4 Whack it in half, twice 

Division by 2 is easy (cf. Section [3]), and hence so is division by 4: 

ix A^ix B => 2x A^2x B A^B. 

We made it hard for ourselves in order to show a method that works for all n. 
It would have been more natural to take n = 3, which is the crucial test case 
for division. If you can divide by 2, and hence by 4, there is no guarantee 
that you can divide by 3; whereas if you can divide by 3, you can divide by 
any n. This is not meant as a formal statement, it’s what you might call a 
Thesis. We chose n = 4 instead of n = 3 because there are four suits in a 
standard deck of cards, and because there is already a paper called ‘Division 
by three’ |2], which this paper is meant to supersede. 

2.5 Plan 

In Section [3] we discuss division by two, and explain why it is fundamentally 
simpler than the general case. In Section 0] we introduce Short Division. In 
Section 0] we take a short break to play Pan Galactic Solitaire. In Section 
|6] we reproduce classical results on subtraction and division, so that this 
work can stand on its own as the dehnitive resource for these results, and 
as preparation for Section [TJ where we discuss how long the Long and Short 
Division algorithms take to run. In Section [8] we discuss the tangled history 
of division. In Section [3 we wrap up with some platitudes. 
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3 Division by two 

Why is division by two easy? The flippant answer is that 2 — 1 = 1. Take a 
look at Conway and Doyle [2] to see one manifestation of this. Here is how 
this shows up in the context of Shipshaping. 

The reason Shipshaping has a Shape Up round is that for n > 2, there 
can be more than one bad spade. When n = 2 there can’t be more than one 
bad spade. In light of this, we can leave out the Shape Up rounds. When a 
bad spade lands in the hearts spot of a rack with a heart in the spades spot, 
we just leave it there. It’s probably easier to understand this if we give up 
the good-bad distinction, and just say that the rule is that when both cards 
in a rack are spades, we ship out the spade in the hearts spot. At the end of 
time, there will be at most one spade in each rack, so in the Long Division 
setup there will be at least one heart; assigning to each rack the rank of the 
rightmost heart gives us an injection from racks to ranks. 

In this approach to division by 2, we find that there is no need to worry 
about doing everything in lockstep. We can do the Ship Out steps in any 
order we please, without organizing the action into rounds. As long as any 
two-spade rack eventually gets attended to, we always arrive at the same hnal 
conhguration of cards. This is the kind of consideration that typically plays 
an important role in the discussion of distributed computation, where you 
want the result not to depend on accidents of what happens hrst. It’s quite 
the opposite of what concerns us here, where, in order to keep everything 
canonical, we can’t simply say, ‘Do these steps in any old order.’ Without 
the axiom of choice, everything has to be done synchronously. 

Now in fact the original Shipshaping algorithm works fine as an asyn¬ 
chronous algorithm for any n, but the limiting conhguation will depend on 
the order in which swaps are carried out, so the result won’t be canonical. 
More to the point, without the Axiom of Choice we can’t just say, ‘Do these 
operations in whatever order you please.’ In contrast to the real world, where 
the ability to do everything synchronously would be a blessing, for us it is 
an absolute necessity. 

4 Short Division 

In Short Division we reverse the roles of A and B, so that A is the set of 
ranks and B the set of racks. An injection from 4 x A to 4 x i? now shows a 
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way to deal out all the cards, leaving no cards in the deck, though some of the 
spots in the racks may remain empty. We do just one round of Shipshaping. 
This works just as before, the only new twist being that if the spades spot 
is empty when we come to shape up a bad spade, we simply move the spade 
over into the spades spot. Since now all the cards have been dealt out, we 
don’t ever have occasion to swap with a card still in the deck. 

When A is hnite, all the spades will shape up, and show a bijection from 
ranks (A) to racks (B). 

When A is inhnite, some of the bad spades may get ‘lost’, having been 
shipped out again and again without ever shaping up. These lost spades will 
each have passed through an inhnite sequence of spots, and all these inhnite 
sequences will be disjoint. We use these sequences to hide the lost spades, as 
follows. 

Let Agood denote the ranks of the good spades at the end of the game, 
and Bgood ^ ^good the racks where they have landed. v4bad = A — Agood is 
the the set of ranks of the lost spades. To each element of Abad there is an 
inhnite chain of spots in 3 x Bg^od, and these chains are all disjoint. Using 
these disjoint chains we can use the usual ‘Hilbert Hotel’ scheme to dehne a 
bijection between Hbad U 3 x Bgood and 3 x Hgood, i-e. we make 3 x Bgood 
‘swallow’ Hbad- But if 3 x Hgood can swallow Hbad then so can Hgood (cf. 
Proposition [13] below): Hbad U Bgood ^ -Bgood- So 

A ^bad U ^good ^bad U ^good -Bgood B. 

Note. To make apparent the sequences of cards accumulated by the lost 
spades, we can modify the game by making stacks of cards accumulate under 
the spades, to record their progress. The Shape Up round is unchanged, 
except that we swap the whole spade stack into the spades spot. In the Ship 
Out round, when a spade ships out it takes its stack with it, and places it 
on top of the card it was meant to swap with. Spades that eventually shape 
up will have hnite piles beneath them, but lost spades will accumulate an 
inhnite stack of cards. 


5 Pan Galactic Solitaire 

Let us take a short break to play Pan Galactic Solitaire. 

Deal the standard 52-card deck out in four rows of 13 cards. The columns 
represent the racks, with spots in each rack labelled spades, hearts, diamonds. 
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clubs going from top to bottom. (Though as you’ll see it makes no difference 
how we label the spots, this game is suit-symmetric.) The object is to ‘£x’ 
each column so that the ranks of the four cards are equal and each card is in 
the suit-appropriate spot. We move one card at a time. There is no shaping 
up; shipping out swaps are allowed based on any suit, not just spades. So 
for example if in some column the 3 of hearts is in the hearts spot and the 6 
of hearts is in the diamonds spot, we may swap the 6 of hearts for the 3 of 
diamonds. 

We don’t know a good strategy for this game. Computer simulations 
show that various simple strategies yield a probability of winning of about 
1.2 per cent. Younger players hnd this frustratingly low, and either play 
with a smaller deck or allow various kinds of cheating. Older players are 
not put off by the challenge, and at least one has played the game often 
enough to have won twice. Though because the game recovers robustly from 
an occasional error, he cannot be certain that he won these games fair and 
square. 

A couple of apps have been written to implement this game on the com¬ 
puter. In some versions if you click on the 6 of hearts (as in the example 
above) the app locates the 3 of diamonds for you and carries out the swap. 
This makes the game go much faster, but it is much less fun to play than if 
you must locate and click on the 3 of diamonds, or better yet, drag the 6 of 
hearts over onto the 3 of diamonds. One theory as to why the automated 
version of the game is less fun to play is that the faster you can play the 
game, the more frequently you lose. 

This game is called Pan Galactic Solitaire from the conviction that some¬ 
thing like it will have occurred to a dehnite fraction of all life forms that have 
discovered Short and Long Division. 


6 Cancellation laws 

In this section we reproduce basic results about cancellation. These results 
are all cribbed from Tarski [S], though the notation is new and (we hope) 
improved. 

To simplify notation, write -|- for disjoint union and nA for n x A. (This 
abbreviation is long overdue.) A — B will mean set theoretic complement, 
with the implication that i? is a subset of A. 

The results below guarantee the existence of certain injections and bijec- 


tions, and the proofs are backed by algorithms. For hnite sets these can all 
be made to rnn snappily. 

6.1 Subtraction laws 

The only ingredients here are the Cantor-Bernstein constrnction and the 
closely related Hilbert Hotel constrnction. 

We start with Cantor-Bernstein. 

Proposition 5 (Cantor-Bernstein). 

B A B ^ A A^ B. 

Proof. For a proof, draw the pictnre and follow yonr nose (cf. |2]). 

Here is a version of the proof emphasizing that the desired bijection is 
the pointwise limit of a seqnence of hnitely-compntable bijections, which will 
be important to ns in Section [7] below. 

It snffices to show that from an injection 

f :A^ B CA 


we can get a bijection. 

Say we have any fnnction f : A ^ B, not necessarily injective (this 
relaxation of the conditions is nsefnl, so that we can think of hnite examples). 
Every a G H makes a Valentine card. To start, every a & A — B gives their 
Valentine to /(a) G B. Any b E B that gets a Valentine then gives their 
own Valentine to f{b). Repeat this procednre ad inhnitnm, and at the end 
of the day every b E B has at least one Valentine, and every Valentine has a 
well-dehned location (it has moved at most once). Let g associate to a G A 
the b E B that has a’s Valentine, g is always a snrjection, and if / is an 
injection, is a bijection. 

Here’s a variation on the proof. Again each a E A makes a Valentine, 
only this time every a E A gives their Valentine to /(a). Now any b E B 
that has no Valentine demands its Valentine back. Repeat this clawing-back 
ad inhnitnm, and at the end of the day, every Valentine has a well-dehned 
location, and if / was injective, or more generally if / was hnite-to-one, every 
b E B has a Valentine, and we’re done as before. The twist here is that if 
/ is not hnite-to-one, at the end of the day some 6’s may be left withont 
a Valentine. So they demand their Valentine back, continning a transhnite 
chain of clawings-back. We may or may not be comfortable conclnding that 
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after some transfinite time, every b G B will have a Valentine. For present 
purposes we needn’t worry about this, since when / is injective the fuss is 
all over after at most u steps. □ 

Notation. Write A B {'A is swallowed by B' or 'A hides in i?’) if 
A + B ^ B. By Cantor-Bernstein this is equivalent to A + 5 x 5. Another 
very useful equivalent condition is that there exist disjoint inhnite chains 
inside B, one for each element of A. 

By repeated swallowing we have: 

Proposition 6. For any n, if 


Ai ^ B, i = 0,..., n — 1 

then 

Aq + ... + An—I B. D 

Here are two close relatives of Cantor-Bernstein, proved by the same 
back-and-forth construction. 

Proposition 7. 

A + C FB + C ^ A-AoFB, 

where Aq <^C. □ 

Proposition 8. 

A + C^B + C ^ A-Aq^B-Bq, 

where Aq, Bq C. □ 

Proposition 9. 

A + C FB + 2C AFB + C 

Proof. 

A + C F B + C + C ^ A-AqFB + C 
with Aq C. So 

A = {A - Aq) + Aq F B + C + Aq F B + C. □ 
Proposition 10. For finite m <n, 

A + mC F B + nC A F B + {n — m)C. 
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Proof. The proof is by induction, and we can get the number of recursive 
steps down to 0(log(m)). □ 

From this we get 

Proposition 11. For n > 1 

A + nCFB + nC=>A + CFB + C. □ 

From Cantor-Bernstein we then get 
Proposition 12. For n> 1 

A + nC^B + nC^A + C^B + C. □ 

Here’s the key result for Short Division: 

Proposition 13. For n > 1 

A<^nC A<^C. 

Proof. This is the special case of Proposition [TT] when B is empty. Here we 
redo the proof in this special case, in preparation for the timing considerations 
of Section [71 

Think of nC as a deck of cards, where (i,c) represents a card of suit i 
and rank c. Since A <C nC, there are disjoint inhnite chains 

Sa : cj —)■ nC, a E A. 

Let a{a) to be the smallest i such that Sa contains infinitely many cards 
of suit i, and let pa be the sequence of ranks of those cards. (For future 
reference, note that we could trim this down to the ranks of those cards of 
suit a (a) that come after the last card of a lower-numbered suit.) 

Let 

Ai = {a E A : a{a) = i}. 

The infinite chains 

Pa ■ uj ^ C, a E Ai 

are disioint, so 

Ai^C 

So by Proposition |6l 


A — Aq Afi—i C. 
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The required injection from A + C io C is obtained as a composition of 
injections fi which map A + C onto A — Ai + C, leaving A — Ai hxed. □ 
Here’s a result that will be handy when we come to the Euclidean algo¬ 
rithm. 

Proposition 14. For m < n 

A -h mC X nC H -|- E x (n — m)C, 
where E nC, and hence E C. 

Proof. From above we have 


AF {n — m)C. 


Write 


H -I- E X (n — m)C, 


so that 

A + E + mC X nC. 

Since A + mC x nC this gives 

E + nC X nC E nC E <^C. □ 

Finally, here’s the result Conway and Doyle needed to make their division 
method work. 

Proposition 15. For n>l, 


nA :< nB A B F A A F B, 


and hence by Cantor-Bernstein A^ B. 

Proof. Write 


From 


A^B + C. 


and 


nA -< nB 


A = B + C 
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we get 


nB + nC ^ nB, 


i.e. 


nC nB. 


But 


nC <C nB 


C -C nB 


C^B, 


so 

B + C ^ B. □ 


6.2 Division laws 

Finally we come to division. We’ve already proved Theorems [1] and |2l All 
that remains now is Theorem |3l 

Proof of Theorem [3l As you would expect, the proof is a manifestation 
of the Euclidean algorithm. If m = 1 we are done. Otherwise we will use the 
usual recursion. 


mB -< nB x mA. 


Using division we get 

B<A. 

Write 

A^B + C. 

mB + mC X mA x nB. 

From Lemma [H] we have 


mC + E X (n — m)B, 

with E B, and hence E A. We think of E as ‘practically empty’. If it 
were actually empty, we’d recur using C in place of A. Ditto if we knew that 
E C. As it is we use C + E in the recursion, hoping that this amounts to 
practically the same thing. 

m{C + E) X mC + E + {m — 1)E x (n — m)B + (m — 1)E x (n — m)B. 
So by induction, for some R we have 

U + E X (n — m)R, 
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B X mR. 


Since E ^ A, 

A'^A + E'^B + C + E'^ mR + (n — m)R x nR 


Done. 

We won’t undertake to determine the best possible running time here. 
But in order to make sure it requires at most a logarithmic number of di¬ 
visions, we will want to check that for the recursion we can subtract any 
multiple km from n as along as km < n. Here’s the argument again, in 
abbreviated form. 

kmB ■< nB x mH; 
kB ^ A; 

A^kB + C- 
mkB + mC x mA x nB] 
mC + E ^ {n — km)B, E ^ B ^ A] 

m(C -I- i?) X mC + E + {m — 1)E x (n — km)B + {m — 1)E x (n — km)B; 

C + E {n — km)R, B x mR; 

H X H -I- E X kB + C + E kmR + {n — km)R x nR. □ 

7 How long does it take? 

7.1 The finite case 

In Shipshaping, every swap puts at least one card in a spot where it will stay 
for good, so the number of swaps is at most n\A\\ This holds both in the 
Long Division setup where nA is the set of spots, and in the Short Division 
setup where nA is the set of cards. For a distributed process where all Shape 
Up and Ship Out rounds take place simultaneously the number of rounds 
could still be nearly this big, despite the parallelism, because one bad spade 
could snake its way through nearly all the non-spades spots. If we simulate 
this distributed process on a machine with one processor, the running time 
will be 0(n|H|) (or maybe a little longer, depending on how much you charge 
for various operations). 
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Note. While \B\ might be as large as n\A\, nothing requires us to allocate 
storage for all n\B\ cards (in Long Division) or spots (in Short Divison). 

For hnite A, in Short Division all spades will shape up after one pass of 
Shipshaping, and show an injection from ranks to racks. So the running time 
for Short Division is 0{n\A\), running either as a distributed process or on 
a single processor. This is as good as we could hope for. 

Still for hnite A, Long Division with the naive recursion takes n — 1 
passes of Shipshaping. If we divide by 2 whenever an intermediate value of 
n is even, we can get this down to at most 0(log(n)) passes. The number 
of suits remaining gets halved at least once every other pass, so the total 
number of swaps over all rounds of Shipshaping will be at most 

\A\{n + n + n/2 + n/2 + n/A + n/A + ...) = An\A\. 

Hence the total time for Long Division is 0{n\A\), running either as a dis¬ 
tributed process or on a single-processor machine. This is the same order 
as for Short Division, though Short Division will win out when you look at 
explicit bounds. 

7.2 The infinite case 

For A inhnite, to talk about running times we will need a notion of transhnite 
synchronous distributed computation. The general idea is to support taking 
the kind of pointwise limits that show up in Shipshaping, where in the limit 
the contents of each spot is well dehned, as is the goodness (but not the 
location) of each spade. (See [7] below for more specihcs.) 

One round of Shipshaping will take time u. For Long Division sped up 
as in the hnite case so that as to take 0(log(?7,)) passes, the running time will 
be u ■ 0{\og{n)). 

Note. Here and throughout, we’ll be rounding running times down to 
the nearest limit ordinal, so as not to have to worry about some hnite number 
of post-processing steps. 

For real speed, we’ll want to use Short Division. The swallowing step can 
be implemented by a recursion which, like Long Division, can be sped up to 
take 0(log(?7,)) passes. Though the number of passes is on the same order as 
for Long Division, this can still be considered an improvement, to the extent 
that Shipshaping is more complicated than swallowing. 

The big advantage of Short Division stems from the fact that the swal¬ 
lowing can be sped up to run in time u, in essence by running the steps 
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of the recursion in tandem. And the swallowing can be conhgured to run 
simultaneously with the Shipshaping. Sped up in this way, Short Division 
can be done in time at most u. We discuss this further in Section [3 below. 

By contrast, we’ve never found a way to run the division stages of Long 
Division in tandem. 

7.3 Dividing a bijection 

To divide a bijection to get a bijection, we can simultaneously compute injec¬ 
tions each way, and then combine them using the Cantor-Bernstein construc¬ 
tion in additional time uj. (Cf. Proposition El) So if dividing an injection 
takes time cu, dividing a bijection takes time at most u: ■ 2. 

Can this be improved upon? It is tempting to start running the Cantor- 
Bernstein algorithm using partial information about the two injections being 
computed, but we haven’t made this work. The case to look at hrst is n = 2, 
where Long Division is all done after one pass of Shipshaping. 

7.4 Speeding up Long Division 

A division algorithm takes as input an injection /„ : nA —>■ nB, and produces 
as output an injection In the Long Division setup, where all the 

spots are hlled though not all the cards need have been dealt out, one pass of 
Shipshaping gets us from /„ to fn-i, or more generally, from /„ to fn(k-i)/k, 
for any k dividing n. In particular, when n is even one pass gets us fn/ 2 - 
This allows us to get from /„ to /i in 0(log(n)) passes. As one Shipshaping 
pass takes time at most cu, this makes for a total running time of at most 
u ■ 0(log(n)). 

Various tricks can be used to cut down the number of passes in Long 
Division. We can run Shipshaping using any divisor of n we please, or better 
yet, run it simultaneously for all divisors. Knowing fm for as many values 
of m as possible can be useful because if we know injections from rukA to 
rrikB then we can paste them together to get an injection from mA to mB 
for any positive linear combination m = Y^k'^k^k- This pasting takes only 
hnite time, which in this context counts as no time at all. Combining these 
observations we can shave down the number of Shipshaping passes needed, 
and in the process we observe some intriguing phenomena. But we can never 
get the number of passes below flog 2 u], which is at most a factor of two 
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better than what we achieve with the naive method of dividing by 2 when 
any intermediate n is even. 

7.5 Speeding up Short Division 

For real speed, we nse Short Division. In this setnp, all the cards are dealt 
ont, thongh not all the spots need be hlled. As the Shipshaping algorithm 
rnns, we observe a steadily decreasing collection of bad spades, together 
with steadily lengthening disjoint seqnences in (n — {0}) x B telling the 
spots throngh which these bad spades have passed. In the limit, the bad 
spades that remain have wandered forever, and they index disjoint injective 
seqnences in (n — {0}) x B. 

The proof of Proposition [11] offers a recnrsive algorithm for hiding the 
lost bad spades in with the good, which when sped np in the by-now nsnal 
way reqnires 0(log(?7,)) recnrsive ‘passes’, which if carried ont one after the 
other, give ns a total rnnning time of oj ■ 0(log(n)). (We’ve silently absorbed 
the u coming from the single pass of Shipshaping.) This is on the same order 
as what we achieved with Long Division. 

Having to do only one ronnd of Shipshaping conld be viewed as an im¬ 
provement, on the gronnds that a Shipshaping is more complicated than a 
swallowing pass. Bnt as we’ve stated before, the real advantage of Short 
Division come from the fact that we can rnn all the swallowing passes in 
tandem with each other, and with the Shipshaping algorithm. 

Here’s ronghly how it works. As we rnn the Shipshaping pass for Short 
Division, the set of bad spades decreases, while the seqnences of spots they 
have visited steadily increases. As these preliminary results trickle in, we 
can be computing how we propose to hide the shrinking set of bad spades 
among the growing set of good spades. Eventually every bad spade knows 
where it will go, as does every good spade. In the limit we have an injection 
from spades, and hence ranks, to racks. The whole procedure is done after 
only OJ steps (though as always we reserve the right to do some hxed hnite 
number of post-processing steps). 

The main thing missing here is how we end up hiding the lost spades, and 
how we compute this. The fundamental idea is to trim the sequence of spots 
visited by a lost spade down to the subsequence consisting of all the hearts 
visited, then all the diamonds after the last heart, then all the spades after 
the last diamond. These trimmed sequences can be computed in tandem 
with Shipshaping, so they are ready for use after time u. (See Section 17.61 
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below.) 

Now because these sequences are increasing we can determine the limiting 
suit of all these sequences by running along them, keeping track of the suit, 
which eventually stops changing. After another u steps we’ve computed 
what in the notation of Proposition [13] was the function a. Then we can 
distinguish the lost spades according to this limiting value. We hrst hide 
those that limit with hearts, then diamonds, then clubs. This hiding takes 
only 0{n) post-processing steps, which we disregard, for a total running time 
of a; • 2: a; to compute and trim the tracks of the lost spades; uj to determine 
the limiting suits of the lost spades. 

To get the running time down to uj is trickier. The idea is that we 
start using the trimmed sequences before we are done computing them. The 
injection we compute will be different from that just described, because of 
artifacts associated to the cards in the trimmed sequences that come before 
those of the limiting suit (e.g. a hnite number of hearts coming before an 
inhnite number of diamonds). We omit the details. 

7.6 Chipshaping 

Here is a variation on Shipshaping that incorporates the simultaneous deter¬ 
mination of the trimmed sequences of spots that the lost spades have passed 
through. 

In this variation, we begin by placing a poker chip on top of each spade, 
which will serve as its representative during the Shape Up and Ship Out 
rounds. (If the racks are tilted, we’ll have to lean the chip against the card, 
maybe it would be better to think of the cards laid out in rows as in Pan 
Galactic Solitaire.) When we shape up a chip we move the card and chip 
together, but when we ship out we move only the chip. We add a third round 
called Trim: If a card has a chip on it, and the card is a spot to the left of 
the spot for its suit (e.g. a club in a hearts spot), we leave the chip where 
it is, and swap the card underneath the chip to where it belongs, meaning 
the spot for its suit in the rack having the spade of its rank in the spades 
spot. (If the chip has just moved, this is where the chip came from; in any 
case it is someplace the chip has visited before.) We repeat the Trim round 
until no more moves are possible, meaning that no card that is topped by 
a chip is ‘above its station’. This takes only a hnite number of rounds (at 
most equal to the number of rounds Ship Out rounds we have done). Then 
we continue: Shape Up, Ship Out, Trim, Shape Up, Ship Out, Trim, Trim, 
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Shape Up, Ship Out, Trim, Trim, Trim,... 

If you try this, you’ll see that it is really quite nice, though it is annoying 
that we might have to wait through an ever larger number of Trim rounds. 
An alternative is to do just one Trim round, but this entails—well, try it, 
and you’ll see. 

In any event, we aren’t quite computing everything we’ll need in order to 
see an injection from A to i? in the limit after only oj steps. For that, it seems 
that we might have to go beyond what you can conveniently do with just 
the original deck of cards and some chips. Like, say, add some local memory 
for pointers, maybe in the form of stickers affixed to cards or racks. There’s 
nothing wrong with this from the point of view of distributed synchronous 
computation. It just won’t be as much fun. 

7.7 Transfinite synchronous distributed computation 

We’re using here a loose notion of transhnite computation. Roughly speak¬ 
ing, we’re imagining that we have a processor for each element of A and 
B, or maybe (for convenience) for each element of n x A and n x B. Each 
processor has some hnite set of flags, a hnite set of registers that can store 
the name of an element of A or B, and a hnite set of ports. The processors 
can communicate by sending messages to a designated port of another pro¬ 
cessor; if two processors simultaneously attempt to send to the same port of 
another processor, the whole computation crashes. We allow processors to 
set the hag of another processor; if two or more processors simultaneously 
try set the same hag, there is no conhict. Certain hags and registers may be 
designated as suitable for reading at limit times u,u ■ 2,.... If the state of 
one of these hags or the contents of one of these registers fails to take on a 
limiting value, the whole computation crashes. 

This is as precise as we want to get here. Really, we’re hoping to hnd 
that a suitable formulation has already been made and explored. It would be 
great if there is essentially one suitable formulation, but we are by no means 
certain of this. 


8 Some history 

Here’s a brief rundown on the history of Theorems [1] and [2l 
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Theorems [Hand [2] follow easily from the well-ordering principle, since then 
n X A ^ A when A is inhnite. The well-ordering principle is a consequence 
of the power set axiom and the axiom of choice (which without the power 
set axiom may take several inequivalent forms). The effective proofs we are 
interested in don’t use either axiom. 

Briefly put, Bernstein stated Theorem 2 in 1905 and gave a proof for 
n = 2, but nobody could make sense of what he had written about extending 
the proof to the general case. In the 20’s Lindenbaum and Tarski found 
proofs of Theorems 2 and 1 but didn’t publish them. Lindenbaum died and 
Tarski forgot how their proofs went. In the 40’s Tarski found and published 
two new proofs of Theorem 1. In the 90’s Conway and Doyle found a proof, 
after hnally peeking at Tarski [S]; based on what Tarski had written, they 
decided that their proof was probably essentially the same as Lindenbaum 
and Tarski’s lost proof. 

For the gory details, here is Tarski |8], from 1949. (We’ll change his 
notation slightly to match that used here.) 

Theorem 2 for n = 2 was hrst proved by F. Bernstein in pp. 

122 ff.]; for the general case Bernstein gave only a rough outline 
of a proof, the understanding of which presents some difficulties. 
Another very elegant proof of Theorem 2 in the case n = 2 was 
published later by W. Sierpinski [6]; and a proof in the general 
case was found, but not published, by the late A. Lindenbaum [H 
p. 305]. Theorem 1 — from which Theorem 2 can obviously be 
derived by means of the Cantor-Bernstein equivalence theorem 
— was hrst obtained for n = 2 hj myself, and then extended to 
the general case by Lindenbaum |H p. 305]; the proofs, however, 
were not published. Recently Sierpinski [7] has published a proof 
of Theorem 1 for n = 2. 

A few years ago 1 found two different proofs of Theorem 1 
(and hence also, indirectly, of Theorem 2). ... The second proof 
is just the one which 1 should like to present in this paper. It 
is in a sense an extension of the original proof given by me for 
n = 2, and is undoubtedly related to Lindenbaum’s proof for the 
general case. Unfortunately, 1 am not in a position to state how 
close this relation is. The only facts concerning Lindenbaum’s 
proof which 1 clearly remember are the following: the proof was 
based on a weaker though related result previously obtained by 
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me, which will be given below ...; the idea used in an important 
part of the proof was rather similar to the one used by Sierpinski 
in the above-mentioned proof of Theorem 2 for n = 2. Both 
of these facts apply as well to the proof I am going to outline. 

On the other hand, my proof will be based upon a lemma ..., 
which seems to be a new result and which may present some 
interest in itself; it is, however, by no means excluded that the 
proof of this lemma could have been easily obtained by analyzing 
Lindenbaum’s argument 

Observe Tarski’s delicate description of Bernstein’s attempts at the gen¬ 
eral case. Conway and Doyle [2] were less circumspect: 

We are not aware of anyone other than Bernstein himself who 
ever claimed to understand the argument. 

Around 2010 Arie Hinkis claimed to understand Bernstein’s argument 
(see 0). Peter proposed to Cecil that he look into this claim for his Bar- 
month senior thesis project. Between 2011 to 2013 we spent many long hours 
trying to understand Hinkis’s retelling of Bernstein’s proof, and exploring 
variations on it. In the end, we hit upon the proofs given here. These proofs 
are very much in the spirit of Bernstein’s approach, but not close enough that 
we believe that Bernstein knew anything very like this, and we remain skep¬ 
tical that Bernstein ever knew a correct proof. There are very many possible 
variations of Bernstein’s general idea, a lot of which seem to almost work. 
Still, these two proofs can be seen as a vindication of Bernstein’s approach. 
Our finding them owes everything to Hinkis’s faith in Bernstein. 

9 Valediction 

Tarski [8l p. 78] wrote that ‘an effective proof of [Theorems [T] and |2] is by no 
means simple.’ We hope that you will disagree, if not from this treatment 
then from Rich Schwartz’s enchanting presentation in [5]. 

The reason that division can wind up seeming simple to us is that combi¬ 
natorial arguments and algorithms are second nature to us now. A different 
way of looking at the problem makes its apparent difficulties disappear. 

For an even more persuasive example of this, consider the Cantor-Bernstein 
theorem, and the confusion that accompanied it when it was new. From a 
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modern perspective, you just combine the digraphs associated to the two 
injections and follow your nose. 

After a century of combinatorics and computer science, it’s easy to un¬ 
derstand how Pan Galactic Division works. Can we hope that someday folks 
will marvel at how hard it was to discover it? 
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